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In general, prokaryotes are considered to be single-celled organisms that lack internal membrane- 
bound organelles. However, many bacteria produce proteinaceous microcompartments that serve 
a similar purpose, i.e. to concentrate specific enzymic reactions together or to shield the wider 
cytoplasm from toxic metabolic intermediates. In this paper, a synthetic operon encoding the key 
structural components of a microcompartment was designed based on the genes for the 
Salmonella propanediol utilization (Pdu) microcompartment. The genes chosen included pduA, 
-B, -J, -K, -A/, -7and -U, and each was shown to produce protein in an Escherichia co// chassis. In 
parallel, a set of compatible vectors designed to express non-native cargo proteins was also 
designed and tested. Engineered hexa-His tags allowed isolation of the components of the 
microcompartments together with co-expressed, untagged, cargo proteins. Finally, an in vivo 
protease accessibility assay suggested that a PduD-GFP fusion could be protected from 
proteolysis when co-expressed with the synthetic microcompartment operon. This work gives 
encouragement that it may be possible to harness the genes encoding a non-native 
microcompartment for future biotechnological applications. 



INTRODUCTION 

Compartmentalization of biochemical processes is an 
essential feature of all cellular systems. The occurrence 
of membrane-bound organelles is well documented for 
eukaryotic systems and it is well established that Gram- 
negative bacteria house specialized biochemical processes 
in their periplasms. In recent years, however, it has become 
increasingly clear that proteinaceous subcellular compart- 
ments akin to organelles are utilized by some prokaryotes 
and that they serve to partition specific metabolic pathways 
from the bulk cytoplasm (e.g. Kerfeld et ah, 2010). The first 
bacterial microcompartment (BMC) described was the 



Abbreviations: BMC, bacterial microcompartment; BMV, bacterial 
microcompartment vertex; HA, haemagglutinin; IMAC, immobilized metal 
affinity chromatography; RBS, ribosome-binding site. 

One supplementary table is available with the online version of this 
paper. 



carboxysome, which is highly active in the fixation of 
carbon dioxide and encapsulates two proteins: ribulose- 
1,5-bisphosphate carboxylase/oxygenase and carbonic 
anhydrase (Shively et al, 1973; So et al, 2004). More 
recently, BMCs have been associated with other metabolic 
processes such as 1,2 -propanediol catabolism and ethanol- 
amine degradation (Bobik et ah, 1999; Kofoid et a/., 1999) 
- pathways that require the activity of more than five 
different enzymes plus associated cofactors. 

BMCs have a polyhedral organization and are assembled 
from 'shell proteins' containing one or two copies of a 
conserved BMC domain (Pfam ID: PF00936). These 
proteins are organized as circular hexamers (or pseudo- 
hexamers for those with tandem BMC domains) (Crowley 
et aU 2008; Kerfeld et al, 2005; Sagermann et ah, 2009). At 
least some of the hexameric shell proteins from the 
different BMCs appear to contain central pores that may 
facilitate exchange of small metabolites between the 
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compartment and the cytoplasm (Kerfeld et ah, 2005; 
Sagermann et al, 2009). These hexameric arrangements are 
assumed to form the faces of the polyhedron, with the 
vertices (Kerfeld et al, 2005) being formed from a 
pentameric protein of the BMC vertex (BMV) family 
(Tanaka et ah, 2008; Wheatley et aU 2013). 

How specific proteins and enzymes are targeted to the 
interior of a BMC is not fully understood; however, in 
some cases a short, uncleaved N-terminal signal sequence 
has been identified on proteins destined for encapsulation 
(e.g. Fan et al, 2010; Parsons et al, 2010). Recently, it was 
demonstrated that the targeting sequence of the propion- 
aldehyde dehydrogenase (PduP) protein from Salmonella 
enterica (hereafter Salmonella) interacts with a short helix 
present at the C-terminus of PduA, one of the shell 
proteins of the BMC dedicated to propanediol utilization 
(Pdu) (Fan et al, 2012). This finding points to a model 
whereby proteins are encapsulated into the BMC during its 
assembly, rather than being targeted into the fully formed 
BMC post-assembly (Fan et al, 2012). 

Since their discovery, BMCs have attracted interest for their 
potential biotechnological applications (e.g. Frank et ah, 
2013). The possibility to increase the efficiency, or rate of 
flux, of a biological pathway through encapsulation, and 
thus concentration, within a microcompartment remains 
an attractive one and this would also allow for the shielding 
of potentially toxic metabohtes from the cell cytoplasm. It 
would be of potentially great interest to design and build a 
minimalist, empty, synthetic BMC that would be consti- 
tutively expressed and adaptable to any number of possible 
applications. In parallel with the synthetic BMC, compat- 
ible protein targeting tools would also have to be designed 



and tested so that enzymes of choice could be packed 
within any synthetic BMC so developed. In this study, 
some progress has been made towards those aims. The 
Salmonella pdu system was used as a template and seven 
genes from that cluster (pduA, -B, -/, -X, -iV, -Tand -LO, 
which encode the known shell proteins, were assembled 
into a synthetic operon. The synthetic BMC proteins were 
shown to be produced in an Escherichia coli K-12 chassis, 
which is one of the current organisms of choice for 
bioengineering studies (Keasling, 2008; Lee et al, 2008). It 
was demonstrated that attachment of the N-terminal 20 aa 
of PduD is sufficient to allow co-purification of hetero- 
logous proteins with the protein components of the BMC. 
Finally, an in vivo protease accessibility assay suggests a 
proportion of a PduD-linked reporter protein is protected 
from proteolysis when co-expressed with the BMC genes. 

METHODS 

Bacterial strains and growth conditions. All DNA manipulations 
were carried out using E. coli strain DH5a [<I>80d/acZAM15 recAl 
endAl gyrA96 thi-1 hsdR17{r^ mf ) supE44 relAl deoR A{lacZYA- 
argE)U169]. For expression of radiolabelled gene products from 
plasmids under control of the phage T7 010 promoter, plasmids were 
transformed into E. coli strain K38 (HfrC phoA4 pit- 10 tonA22 
ompF627 relAl) (Lyons & Zinder, 1972), which carries the compatible 
plasmid pGPl-2 (Kan^) coding for the T7 RNA polymerase (Tabor & 
Richardson, 1985). Protein purification and fluorescence microscopy 
were carried out using plasmids transformed into strain MG1655 (F~ 
X~ ilvG~ rfb-50 rph-1) (Blattner et a/., 1997). 

Plasmid construction. A Ust of all the plasmids constructed in this 
study is provided in Table 1. To amplif)^ the Salmonella pduAB 
genes whilst supplying a 5'-CACAGAGGAACAGGT-3' linker, which 
includes an artificial ribosome-binding site (RBS) and a six-base 



Table 1. Plasmids utilized in this work 



Plasmid 


Relevant features 




Source 


pUNI-PROM 


Cloning vector for expression of genes under the control of the tat and T7 promoters; 
Amp^ 


Jack 


et al (2004) 


pUNI-AB(Pst) 


Produces PduA and PduB^^^; two native Pstl restriction sites present in pduB 


This 


study 


pUNI-AB 


As pUNIPROM-AB(Pst) except Pstl restriction sites removed from pduB 


This 


study 


pUNI-JK 


Produces PduJ and PduK^'' 


This 


study 


pUNI-N 


Produces PduN^^' 


This 


study 


pUNI-TU(Pst) 


Produces PduT and PduU^^^; one native Pstl restriction site present in pduU 


This 


study 


pUNI-TU 


As pUNIPROM-TU(Pst) except that Pstl site removed from pduU 


This 


study 


pUNI-ABTU 


Produces PduA, PduB^'', PduT and PduU^'' 


This 


study 


pUNI-ABTUN 


Produces PduA, PduB^'', PduT, PduU^'' and PduN^'' 


This 


study 


pUNI-ABTUNJK 


Produces PduA, PduB^^^ PduT, PduU^K PduN^^^ PduJ and PduK^^' 


This 


study 


pSU-PROM 


Cloning vector for expression of genes under the control of the tat promoter; Km^ 


Jack 


et al (2004) 


PSU-D20 


Encodes the first 20 aa of PduD 


This 


study 


PSU-D40 


Encodes the first 40 aa of PduD 


This 


study 


PSU-D20-GFP 


Encodes the first 20 aa of PduD fused to GFP 


This 


study 


pSU-D2o-mCherry(Pst) 


Encodes the first 20 aa of PduD fused to C-terminally HA-tagged mCherry; 
native Pstl restriction site present in mcherry 


This 


study 


pSU-D2o-mCherry 


As pSUPROM-D2o-niCherry(Pst) except Pstl restriction site removed from mcherry 


This 


study 


pSU-D4o-GFP-SsrA 


Encodes the first 40 aa of PduD fused to GFP with a C-terminal SsrA tag 


This 


study 
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spacer to the initiation codon, at the 5' end of pduA and a hexa-His 
tag coding sequence to the 3' end oipduB, oUgonucleotides PduAB-1 
(5'-GCGCTGATCACACAGAGGAACAGGTATGCAACAAGAAGC- 
ACTAGGAATGG-3') and PdiiAB-2 (5'-GCGCAAGCTTCTGCAGG- 
GATCCTTAGTGATGGTGATGGTGATGGATGTAGGACGGACGA- 
TCGTTTTTCGG-3') were used with S. enterica serovar Typhimurium 
LT2 genomic DNA as template (McClelland et aU 2001). To amplify 
the pduJK pairingy supplying a hexa-His tag coding sequence to the 3' 
end of pduKy oligonucleotides PduJK-1 (5'-GCGCTGATCACACAGA- 
GGAACAGGTATGAATAACGCACTGGGACTGGTTG-3') and PduJK-2 
(5'-GCGCAAGCTTCTGCAGGGATCCTTAGTGATGGTGATGGT- 
GATGCGCTTCACCTCGCTTGCCGGAATGAATGC-3') were used. 
To amplify the pduN with a 3' hexa-His tag coding sequence, 
oligonucleotides PduN-1 (5'-GCGCTGATCACACAGAGGAACAG- 
GTATGCATCTGGCACGAGTCACGGGCG-3') and PduN-2 (5'-G- 
CGCAAGCTTCTGCAGGGATCCTTAGTGATGGTGATGGTGATG- 
ACACGAAAGCGTATCTACAATGCCG-3') were used. To amplify the 
pduTUpairingy supplying a hexa-His tag coding sequence to the 3' end 
of pduUy oligonucleotides PduTU-1 (5'-GCGCTGATCACACAGA- 
GGAACAGGTATGTCTCAGGCTATAGGAATTTTAGAACTCACC-3') 
and PduTU-2 (5'-GCGCAAGCTTCTGCAGGGATCCTTAGTGATGG- 
TGATGGTGATGCGTCCGGGTGATCGAGCAAGTGGTG-3') were uti- 
lized. In each case the resultant PGR products were digested with BcR 
and Hi^dlll, and cloned into 5amHI-Hz^dIII-digested pUNI-PROM 
(Jack et aU 2004) to give plasmids pUNIPROM-AB(Pst), pUNIPROM- 
JK, pUNIPROM-N and pUNIPROM-TU(Pst), respectively. QuikChange 
site-directed mutagenesis was then employed to sequentially remove two 
Pstl sites from the coding sequence of pduB in plasmid pUNIPROM- 
AB(Pst) using the primer pairs PduB-pst-l-l/PduB-pst-1-2 (5'-CGGC- 
TATGGCAGAAAAAAGCTGTAGTTTAACGGAATTTGTCGGG-3'; 5'- 
CCCGACAAATTCCGTTAAACTACAGCTTTTTTCTGCCATAG- 
CCG-3') and PduB-pst-2-l/PduB-pst-2-2 (5'-GCCGGACATATC- 
GAGCTGCAATACACCGCTCGCGCCAGC-3 ' ; 5 ' -GCTGGCGCGA- 
GCGGTGTATTGCAGCTCGATATGTCCGGC-3') to give plasmid 
pUNIPROM-AB. Site-directed mutagenesis was also used to remove 
the single Pstl site from the coding sequence of pduU in plasmid 
pUNIPROM-TU(Pst) using the primer pair PduU-pst-l/PduU-pst-2 
(5'-CTCTTTAAGAAGCTGGGCCTGCAAGATGCAGTGTCCGCCAT- 
TGGC-3 ' ; 5 ' -GCCAATGGCGGACACTGCATCTTGCAGGCCCAGC- 
TTCTTAAAGAG-3'). 

To concatenate the cloned pdu genes, the pduTU §,enes were amplified 
using the PduTU-1 and PduTU-2 primers and pUNIPROM-TU as 
template, digested with Bell and Hindllly and cloned into BamHl- 
Hindlll pUNIPROM-AB, to give pUNIPROM-ABTU. To add pduN 
to this, the pduN gene was amplified with PduN-l/PduN-2, digested 
with BeR and Hindllly and cloned into BamHl-Hindlll pUNIPROM- 
ABTU, to give pUNIPROM-ABTUN. Finally, to add the pduJK genes 
they were amplified with PduJK-l/PduJK2, digested with Bell and 
Hindllly and cloned into BamHl-Hindlll pUNIPROM-ABTUN, to 
give pUNIPROM-ABTUNJK. 

For construction of the plasmids coding for the first 20 or 40 aa of PduD 
(encompassing the predicted BMC targeting sequence) oligonucleotide 
PduD-5^/II-For (5'-GCGCAGATCTCACAGAGGAACAGGTATGGAA- 
ATTAATGAAAAATTGCTGCGCC-3'), which includes the same 5'- 
CACAGAGGAACAGGT-3' RBS and linker employed for the BMC 
cloning, was used along with either PduD20-rev-Xba (5'-GCG- 
CTCTAGACTTCATATCGCGGAGTACGTCTTC-3') or PduD40-rev- 
Xba (5'-GCGCTCTAGAAGCGGTCTGTGGTGCTGTGGATGC-3') with 
Salmonella chromosomal DNA as PGR template. The resultant PGR 
products were digested with BgUl and Xbaly and separately cloned into 
BamHI-X?7aI-digested pSUPROM (Jack et a/., 2004) to give pSUPROM- 
D20 and PSUPROM-D40, respectively. A gene encoding GFP from plasmid 
pTGS (DeLisa et aU 2002) was amplified, lacking its initiation codon, with 
oligonucleotides GFP Primer 1 (5'-GCGCTCTAGAAGTAAAGGAGA- 
AGAACTTTTCACTG-3') and GFP Primer 2 (5'-GCGCAAGCTTCTGC- 
AGGGATCCTTACGCATAGTCCGGCACATCGTACGGATATTTGTA- 
TAGTTCATCCATGCCATGT-3'), digested with Xhal and Hi^dlll, 



and cloned into similarly digested PSUPROM-D20 to give pSUPROM- 
D20-GFP. The same GFP gene encoding a C-terminal SsrA tag was released 
from plasmid pTGS by digestion with Xhdi and Hi^dlll, and cloned into 
similarly digested PSUPROM-D40 to give pSUPROM-D4o-GFP-SsrA. A 
gene encoding mCherry, lacking an initiation codon, was amplified with 
oligonucleotides mCherry Primer 1 (5'-GCGCTCTAGAGTGAGCAAG- 
GGCGAGGAGGATAACA-3') and mCherry Primer 2 (5'-GCGCAAG- 
CTTCTGCAGGGATCCTTACGCATAGTCCGGCACATCGTACGGAT- 
ACTTGTACAGCTCGTCCATGCCGCCG-3'), digested with Xhal and 
Hindllly and cloned into similarly digested PSUPROM-D20 to give 
pSUPROM-D2o-mCherry(Pst). Subsequently, the native Pstl site was 
removed from the mCherry coding sequence by site-directed mutagenesis 
using oligonucleotides mCherry PsA site 1 /mCherry Pstl site 2 (5'-AC- 
CCAGGACTCCTCCCTACAGGACGGCGAGTTCATC-3 ' ; 5 ' -GATGA- 
ACTCGCCGTCCTGTAGGGAGGAGTCCTGGGT-3'). The subsequent 
plasmid was designated pSUPROM-D2o-niCherry. 

All constructs made during this study were sequenced on both strands 
to ensure that no undesired mistakes had been introduced during the 
amplification procedure. 

Protein methods. For expression tests of pdu genes encoded by 
plasmids pUNIPROM-AB, pUNIPROM-JK, pUNIPROM-N, 
pUNIPROM-TU, pUNIPROM-ABTU, pUNIPROM-ABTUN and 
pUNIPROM-ABTUNJK, which are under control of the phage T7 
010 promoter, each plasmid was used to transform E. eoli strain K38/ 
pGPl-2 (Tabor & Richardson, 1985). Synthesis of plasmid-encoded 
gene products was induced by a temperature shift from 30 to 42 °C 
and followed by labelling with p^S]methionine/cysteine mixture as 
described previously (Coulthurst et al, 2012; Tabor & Richardson, 
1985). Samples were separated by SDS-PAGE (12% w/v acrylamide) 
after which gels were fixed in 5% (v/v) acetic acid, 10% (v/v) 
methanol, dried and proteins visualized by autoradiography. 

Isolation of His-tagged Pdu complexes was carried out by immobilized 
metal affinity chromatography (IMAC). Briefly, 5-10 1 of culture was 
grown overnight in static 5 1 Duran bottles containing LB medium 
supplemented with 0.4% (w/v) glucose. Cells were harvested by 
centrifugation and washed once in 50 mM Tris/HCl (pH 7.5) before 
being taken up in 25 ml BPER cell lysis cocktail (Thermo Scientific). 
Lysozyme (0.6 mg ml~^) and a crystal of DNase I were added and the 
suspension was stirred at room temperature for 20 min. Debris and 
unbroken cells were removed by centrifugation at 20 °C before the 
supernatant was loaded onto a 5 ml HisTrap FF Crude column (GE 
Healthcare) equilibrated in 50 mM Tris/HCl (pH 7.5), 200 mM KCl, 
50 mM imidazole and 1 mM DDT. Bound proteins were eluted with a 
40 ml gradient of 50-1000 mM imidazole in the same buffer. 

For electron microscopy, freshly isolated protein was adjusted to a 
concentration of 0.05 mg ml~^ before being applied to carbon-coated 
copper grids and stained with 2 % (w/v) uranyl acetate. Micrographs 
were collected at the SULSA Electron Cryomicroscopy Facility at the 
University of Edinburgh. 

SDS-PAGE was performed as described (Laemmli, 1970) and Western 
blotting was according to Towbin et al. (1979). Antisera to the His 
and haemagglutinin (HA) tags were obtained from Qiagen, and the 
anti-GFP antibodies were from Life Technologies. Protein identifica- 
tion was performed at cost to the project by Fingerprints Proteomics 
Service (Dundee). 

Fluorescence microscopy. Cells from 1 ml samples of stationary- 
phase cultures were pelleted by centrifugation, washed with 1 ml of 
PBS and resuspended in 100 \il of PBS. Cells were then fixed by 
addition of 150 ml of 4% (v/v) paraformaldehyde (in PBS) and 
incubated at room temperature for 10 min. Cells were the pelleted by 
centrifugation, washed twice, resuspended in 1 ml of PBS and 
mounted in Hydromount (National Diagnostics). The cells were 
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imaged on a Zeiss LSM 700 laser scanning microscope using a x 100 
Plan-Apochromat objective (numerical aperture 1.46), an optical 
section thickness of 0.7 jim and using the settings optimized for 
EGFP. 



RESULTS 



Predicting the size and geometry of a BMC 

In enteric bacteria such as Salmonella and Citrobacter 
freundii, well- characterized BMCs are used in propanediol 
utilization, with the rationale for compartmentalization 
being the protection of the cell from a potentially toxic 
aldehyde intermediate (Kerfeld et al, 2010). The other 
main biochemical role for a BMC is considered to be the 
concentration of reactants relative to the total cell volume. 
Simple geometric principles can be used to predict the 
physical properties of a BMC and to determine its volume. 
The Salmonella Pdu BMC is thought to be broadly similar 
in structure to the carboxysome (Frank et ah, 2013), given 
the conserved nature of the predicted shell proteins, and 
thus it is possible that a Pdu-based BMC will assemble to 
form something close to an icosahedral structure. 

A regular icosahedron is a polyhedron composed of 20 
pyramids, the bases of which are equilateral triangles. 
Placing all the vertices of these pyramids at one point (the 
centre) ensures that the base triangles tile to form the outer 
surface of the icosahedron. Thus an icosahedron has 20 
faces, 30 edges and 12 vertices on its surface. If the length of 
the edge of a face triangle is denoted by a, for a regular 
icosahedron there is a simple relationship between a and 
the radii of the spheres (Ri^, R^n and R^) that, respectively, 
inscribe (is tangent to all of the faces of a regular 
icosahedron), pass through the middle of each surface 
edge or circumscribe (pass through all of the vertices of an 
regular icosahedron) as follows: 

^i= — {3^^3 +aA15 )a< /?m= 7(1 + V5)«^< 
12 4 




10 + 2^^5 a 



It is known that BMCs vary in size from -100 to 200 nm 
across (carboxysomes are reported as -120 nm across, i.e. 
-60 nm in radius) and it is reasonably assumed that the 
BMC considered here is of a similar size to the 
carboxysome. These measurements are not sufficiently 
accurate to be associated precisely with one of the radii 
defined above. Hence, taking the middle value and setting 
R^=60 nm yields an edge length a of -74.16 nm. The 
volume of the icosahedron, V, can therefore be obtained by 
calculating the volume of a single pyramid and multiplying 
by the number of pyramids (n=20): 

— (3 + ) a' = 8.9x lO'nm' = 8.9 x 10"' ^m' 



A predicted volume for a BMC of 8.9 x 10 ^ |im^ is -1000 
times smaller than that of a single cell of the E. coli chassis, 
which has a volume of -1 |im^. This in turn suggests that 
any reaction that can be housed within a BMC could enjoy 
an increase in efficiency by several orders of magnitude. 
Thus, producing a synthetic BMC that could be used to 
enhance a wide range of biochemical reactions could have 
broad appeal for biotechnology applications. 

Designing and assembling the components of a 
synthetic Pdu microcompartment 

In this work it was decided, due to the close evolutionary 
relationship with the proposed chassis organism E. coli, to 
engineer the Pdu system from Salmonella. The entire 
Salmonella pdu gene cluster contains 18 genes; however, 
previous studies of the C. freundii system have established 
that just five of these proteins (PduA, -B, -J, -K, and -N) 
are sufficient to build an empty BMC, while a further two 
(PduT and -U) also have non-essential structural roles 
(Parsons et ah, 2010). Here, a series of constructs were 
assembled in order to produce these seven shell proteins in 
E. coli (Fig. la). In each case the pUNI-PROM plasmid was 
chosen, which is a derivative of pT7.5 carrying the 
constitutive tat promoter from E. coli (Jack et ah, 2004), 
the idea being that the constitutive promoter would allow 
constant gene expression in any host strain, but that 
overexpression could be induced in strains carrying the T7 
RNA polymerase if so desired. Genes that occurred 
together as natural transcription units on the Salmonella 
chromosome (pduAB, pduJK and pduTU) were cloned 
together with engineered RBSs upstream of the 5' ends and 
engineered sequences encoding hexa-His affinity tags at the 
3' ends of the bicistronic units (Fig. la). The pduN gene 
was cloned in isolation into pUNI-PROM (Fig. la). 
Naturally occurring Pstl restriction sites were silently 
removed in order to comply as far as possible with current 
standards suggested by the Registry of Standard Biological 
Parts. Next, progressively larger synthetic operons were 
built until the final seven-gene operon encoding PduAB^^^ 
PduTU^^ PduN^^' and PduJK^^' was completed (Fig. la). 

To test whether each of the seven shell proteins encoded by 
the bank of pdu constructs could be produced stably in an 
E. coli chassis, the phage T7 promoter present on the 
pUNI-PROM plasmid was exploited. The E. coli K-38 
(pGPl-2) strain, which produces T7 polymerase, was 
transformed with the synthetic constructs. Specific label- 
hng of the plasmid-encoded gene products with 
P^S]methionine/cysteine followed by SDS-PAGE and 
autoradiography revealed clear radiolabelled protein bands 
for almost all of the Pdu proteins (Fig. 2). Most of the 
proteins migrated close to their predicted molecular masses 
(Figs la and 2). Upon overexposure of the samples, two 
forms of the PduB protein were observed, corresponding to 
two different translation initiation sites, and this has been 
reported previously (Havemann 8c Bobik, 2003). The only 
protein that did not appear to be produced stably in this 
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^^Lk PduA . PduB^'s 

^^^^^ 

v^PduJ. PduK^ 
_^ PduN^\ 

^ PduT . PduU^'s. 

^ > \ 

^ PduA . PduB^'s PduT . PduU^'\ 

V PduA . PduB^'s. PduT PduU^'\ PduN^\ 

^ N ^"^^^^> ^y ^^ ^ 

V PduA . PduB^'s PduT . PduU^\ PduN^\ PduJ . PduK^'s 

^ >^=>' >=>^#H==>^=> 

(b) 

1^ ^ 1-40 ^ 1^0 

PduD PduD SsrAtag p^^^ 



PduA 


9 592 


PduB^'s 


24 824-28 839 


PduJ 


9 068 


PduK^'s 


16 794-17 629 


PduN^'s 


9917 


PduT 


19150 


PduU^'s 


13 299 



Fig. 1. Design of a synthetic operon encoding a BMC and a fluorescent reporter system, (a) Cartoons of the constructs used for 
production of the BMC shell proteins. Transcription can be driven by the constitutive E co// tat promoter or phage T7 promoter 
(bent arrow). Natural transcriptional and translational coupling is maintained for pduAB, pduJK and pduTU. The positions of 
synthetic RBSs are indicated by ovals. The respective names of the gene products are indicated above the arrows and the inset 
table gives the predicted molecular masses (MW) of those proteins. The pduB gene is known to be translated with two 
alternative initiation sites, one of which overlaps with the pduA stop codon whilst the other is in-frame but 1 1 1 bp downstream 
(Havemann & Bobik, 2003). Similarly, the pduKgene has two possible translation initiation sites. An AUG codon is present 3 bp 
downstream of the pduJ stop codon but, perhaps more likely, is an alternative GUG initiation codon that is found in-frame but 
21 bp downstream and is preceded by a plausible RBS. (b) Cartoons of constructs for targeting reporter proteins to the 
synthetic BMC. Genetic fusions were made to the N-terminus of the PduD protein and proteolysis or epitope tags are included 
where indicated. Transcription is driven solely by the E. coli tat promoter. 



system was PduK which was visible only as an extremely 
faint band (Fig. 2). 

PduD-GFP fusion generates bright foci when co- 
expressed with the BMC components 

Having established that the synthetic operon could 
produce the required BMC shell proteins, the next step 
was to co-express with potential cargo proteins. A fusion 
protein was designed whereby the N-terminal 20 aa of the 
propanediol dehydratase PduD, which during the course of 
this work have been shown to function as a BMC targeting 
sequence for this protein (Fan &: Bobik, 2011), were fused 
to a GFP reporter. The PduD2o-GFP fusion protein was 
produced under control of the constitutive tat promoter 
from a plasmid with a P15A origin of replication 
(Bartolome et al, 1991) and was therefore compatible for 
co-expression with the pUNI-ABTUNJK construct encod- 
ing the BMC. 

First, the E. coli chassis was transformed with the plasmid 
encoding the PduD2o-GFP fusion alone. Confocal microscopy 
analysis revealed GFP-dependent fluorescence distributed 
uniformly throughout the cell (Fig. 3a). By contrast, only 




Fig. 2. Testing protein production from synthetic constructs in E 
coli. Total cellular proteins were prepared from small-scale cultures 
of K38 (pGP1-2) containing the following plasmids: pUNI-AB 
CAB'), pUNI-JK CJK'), pUNI-N ('N'), pUNI-TU (TU'), pUNI-ABTU 
CABTU'), pUNI-ABTUN ('ABTUN') and pUNI-ABTUNJK 
('ABTUNJK'). Plasmid-encoded gene products were radiolabelled 
and separated by SDS-PAGE on a 15% (w/v) polyacrylamide gel. 
Protein bands were visualized by autoradiography. The asterisk 
shows the position of a faint band corresponding to PduK^'^ 
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when PduD2o-GFP was co-expressed with the synthetic 
operon encoding the BMC shell proteins did clear foci of 
GFP fluorescence become visible, consistent with the 
hypothesis that GFP was being targeted to, or clustered 
around, a new subcellular structure encoded by the pUNI- 
ABTUNJK construct (Fig. 3b). It was also notable the BMC- 
producing cells appeared elongated, perhaps filamentous, in 
morphology in this co-expression experiment (Fig. 3b). Note 
that it is not possible to conclude fi-om these data that 
PduD2o-GFP is inside a fuUy formed BMC. 

Cargo proteins co-purify with the shell proteins of 
the BMC 

Some of the synthetic BMC components (PduB, PduK, 
PduN and PduU) were engineered with hexa-His affinity/ 
epitope tags (Fig. la). This feature was exploited to 
establish if the PduD2o-GFP cargo protein was associated 
with the BMC components in the co-expression experi- 
ment (Fig. 3). The E. coli chassis was transformed with 
pUNI-ABTUNJK and pSU-PduD2o-GFP, and fermented 
overnight in a glucose -supplemented rich medium. Here, 
expression of both BMC and cargo was under control of 
the constitutive tat promoter. A crude cell extract was 
prepared and subjected to IMAC, and proteins were 
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Fig. 3. A PduD-GFP fusion generates bright foci when co- 
expressed with the BMC components, (a) GFP fluorescence 
signals from E. coli MG1 655 containing the plasmid pSU-PduD2o- 
GFP only. This plasmid encodes a covalent fusion between the 
initial 20 residues of PduD and the GFP. Left panel: bar, 10 |am; 
right panel: bar, 2 |am. (b) GFP fluorescence signals from E. coli 
MG1655 co-expressing the plasmid pSU-PduD2o"GFP together 
with the plasmid pUNI-ABTUNJK encoding the synthetic BMC. 
Left panel: bar, 10 |am; centre and right panels: bar, 2 |am. 



specifically eluted with an increasing imidazole concentra- 
tion gradient before being further analysed by SDS-PAGE 
and Coomassie staining (Fig. 4a). Interestingly, if the eluted 
protein sample was not heat denatured three major protein 
bands were detectable that migrated with apparent 
molecular masses between 50 and 75 kDa (Fig. 4a). 
Tryptic peptide mass fingerprinting analysis of these bands 
suggested only one Salmonella protein was present here 
and it was PduB^^^. However, when the same sample was 
heated to 100 °C for 2 min before SDS-PAGE the high- 
molecular-mass PduB^^^-containing bands were no longer 
detectable and were replaced by two clear bands of 
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Fig. 4. A non-tagged GFP reporter co-purifies with BMC 
components. E. coli MG1655 co-expressing the plasmid pSU- 
PduD2o-GFP together with the plasmid pUNI-ABTUNJK encoding 
the synthetic BMC was anaerobically cultured overnight in LB 
medium supplemented with 0.4% (w/v) glucose. Cells were 
harvested and broken by a chemical cocktail before the crude 
extract was loaded on to an IMAC column, (a) Protein fractions 
eluted from the IMAC column were pooled, and analysed by SDS- 
PAGE and Coomassie staining. Indicated bands were identified by 
tryptic peptide mass fingerprinting. MW, molecular mass, (b) 
Pooled fractions from the purification protocol were analysed by 
SDS-PAGE and Western immunoblotting using an anti-GFP 
antibody. 
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-25 kDa, closer to the predicted masses of native PduB 
(Fig. la). Tryptic peptide mass fingerprinting confirmed 
that these 25 kDa bands were indeed PduB^^^ and also 
identified two tryptic peptide masses derived ft-om the 
GFP, giving an initial indication that the reporter had co- 
purified. The remaining protein bands present in the 
sample were also identified by tryptic peptide mass 
fingerprint analysis, which gave unequivocal confirmation 
that the remaining six Pdu proteins, including PduK, were 
present in the column fi-action. Given the mild, non- 
denaturing, conditions used throughout this purification 
protocol, this experiment provides reasonable evidence 
that the BMC component proteins are soluble within the 
bacterial cytoplasm. 

Negative-stain electron microscopy was used to analyse the 
isolated microcompartment proteins (Fig. 5). The grids 
showed a range of irregular particles, most of which were 
<100 nm in size, which is similar to what is expected for 
native Pdu microcompartments (e.g. Cheng et al, 2008; 
Sinha et al. 2012). K low number of particles adopted more 
regular shapes (Fig. 5), more akin to carboxysomes isolated 
from cyanobacteria (lancu et al, 2007). 

In order to localize the PduD2o-GFP reporter protein in 
this experiment, the crude cell extract, together with 
samples of the unbound column flow-through and the 
peak nickel fraction containing the BMC components 
(Fig. 4a), was analysed by SDS-PAGE and Western 
immunoblotting using a GFP-specific antibody (Fig. 4b). 
Although some GFP was not retained by the metal affinity 
column and was found in the flow- through fraction (Fig. 
4a), a proportion of non-tagged GFP was clearly eluted 
along with the bound BMC proteins (Fig. 4b). 

To investigate whether another reporter could also be co- 
purified with the synthetic BMC components, the N-terminal 
20 aa of PduD were genetically fused to mCherry that had 




Fig. 5. Negative-stain electron microscopy of isolated BMC 
proteins. Electron micrograph of isolated microcompartment 
proteins following IMAC. Bar, 200 nm. Particles showing more 
regular shapes are highlighted by the arrows. 



also been supplied with a C-terminal HA epitope tag for 
immunodetection. E. coli was co-transformed with pSU-D2o- 
mCherry^^ and pUNI-ABTUNJK, the culture was fermented 
with 0.4 % (w/v) glucose, and the BMC proteins were purified 
by IMAC under non- denaturing, mild conditions. In this 
case, instead of pooling the fractions that were eluted with 
increasing imidazole, the fractions across the elution peak 
were analysed separately by SDS-PAGE and Western 
immunoblotting (Fig. 6). It is clear that the protein 
composition varied across the peak, with the earlier fractions 
containing an excess of PduB^^^ relative to the other Pdu 
components (Fig. 6a). Interestingly, PduD2o-niCherry™ was 
immune-detected in aU of the fractions eluting from the 
column (Fig. 6b), but the concentration of antigen was 
greatest in the fractions that eluted at the highest imidazole 
concentration, indicative of co-elution of the non-tagged 
PduD2o-mCherry™ with the His-tagged BMC proteins. 
These data indicate that some of PduD2o-niCherry™ is 
tightly associated with at least one component of the BMC. 



^ [Imidazole] 

kDa ^^^^^^^^^^ 

75-^ 

50 




Fig. 6. A non-tagged mCherry reporter co-purifies with BMC 
components. E coli MG1655 co-expressing the plasmid pSU- 
PduDso-mCherry^'^ together with the plasmid pUNI-ABTUNJK 
encoding the synthetic BMC was anaerobically cultured overnight 
in LB medium supplemented with 0.4% (w/v) glucose. Cells were 
harvested and broken by a chemical cocktail before the crude 
extract was loaded on to an IMAC column and bound proteins 
eluted with an imidazole gradient, (a) Individual protein peak 
fractions eluted from the IMAC column during application of an 
imidazole gradient were collected, boiled, and analysed by SDS- 
PAGE and Coomassie staining. Indicated bands were identified by 
tryptic peptide mass fingerprinting, (b) The identical individual 
protein peak fractions as shown in (a) were analysed by Western 
immunoblotting using an anti-HA mAb. 
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Protease protection assay for assessing BMC 
function in vivo 

The evidence presented so far points to the co-expressed 
PduD fusion proteins being associated with the compo- 
nents of the synthetic BMC. Next, it was important to 
perform an experiment that could address whether cargo 
proteins might be located inside the BMC lumen. Thus, an 
additional construct was prepared where a PduD4o-GFP 
fusion was modified at its C-terminus by addition of an 
SsrA proteolysis tag. Under natural conditions a C- 
terminal SsrA peptide is added to endogenous E. coli 
polypeptides on stalled ribosomes by the transfer mRNA 
(Keiler et a/., 1996; Komine et ah, 1994), which then targets 
any so -tagged polypeptide for rapid degradation by the 
ClpAP proteolytic machinery (Karzai et al, 2000). 

E. coli was transformed with plasmids encoding either 
PduD4o-GFP^''^^ alone, the BMC shell proteins alone or the 
PduD4o-GFP^^'''^ fusion protein simultaneously with the 
shell proteins (Fig. 7). All three strains were grown under 
identical conditions and analysed by Western immuno- 
blotting (Fig. 7). When PduD4o-GFP^''^ was produced in 
the absence of the BMC proteins the GFP was unstable, 
presumably because it was rapidly degraded by the ClpAP 
machinery (Fig. 7). By contrast, the PduD4o-GFP^^^'^ fusion 
protein was obviously stabilized when it was co-produced 
with the BMC shell proteins (Fig. 7). This result is 
consistent with PduD4o-GFP^^'''^ fusion being protected 
from proteolysis when co-expressed with the BMC genes. 
One interpretation of these data is that the GFP may be 
inside the BMC and thus shielded from the cytoplasmic 
ClpAP system. 



PduD4o-GFPSs^^ + _ + 
PduABTUNJK ^^^^^^ 

Anti-His ^L^^^-^l ^ PduK^'s 




Fig. 7. Protease protection assay for assessing BMC function in 
vivo. E. go// MG1 655 expressing either the plasmid pSU-PduD4o- 
QppSsM Qi^iy^ ^i^g plasmid pUNI-ABTUNJK encoding the synthetic 
BMC only or both plasmids together were cultured aerobically in 
LB medium before being harvested and resuspended in Laemmli 
disaggregation buffer. The whole-cell samples were then sepa- 
rated by SDS-PAGE and analysed for the presence of His-tagged 
proteins (the BMC components) or GFP (the cargo protein). 



DISCUSSION 

The synthetic BMC characterized here was based on the 
Salmonella Pdu system and seven genes from that BMC- 
encoding gene cluster were organized into a plasmid-borne 
synthetic operon. All seven gene products were successfully 
synthesized from a single promoter; however, small-scale 
expression tests suggested PduK^^^ was produced at a lower 
level, or possibly as a more unstable polypeptide, than the 
others (Fig. 2). The difficulty in detecting radiolabelled 
PduK^^^ was surprising as this protein contains two 
methionines and six cysteines, and so should be expected 
to radiolabel well in this plasmid-based system. Note, 
however, that there is an arrangement of four cysteine 
residues within the C-terminal 30 aa of PduK, which may 
indicate that the protein binds an iron-sulphur cluster. 
Indeed, it has been observed that a novel intermolecular 
iron-sulphur cluster is formed between subunits of another 
of the Pdu proteins (Parsons et ah, 2008). If Salmonella 
PduK were to contain an iron-sulphur cluster then the 
protein might be destabilized in the small-scale expression 
system as native E. coli transcription is completely inhibited 
during the labelling procedure, which may affect function- 
ality of the native iron-sulphur cluster assembly 
machinery. Subsequent larger- scale protein purification 
experiments, in which host cell biochemistry was allowed 
to continue uninhibited, readily identified the PduK^^^ 
protein (Figs 4 and 6). 

Taken together, the GFP and mCherry co- purification 
experiments described here are consistent with the initial 
20 aa of PduD interacting strongly with at least one 
component of the synthetic BMC. Under mild, non- 
denaturing condition these non-tagged reporters were seen 
to co-elute with affinity- tagged shell components. This is 
consistent with studies of the natural substrates of the 
Salmonella Pdu system (Fan &: Bobik, 2011). Recently, the 
signal sequence of Salmonella PduP has been further 
investigated by alanine scanning mutagenesis, which has 
identified E7, 110 and L14 of PduP as being important for 
initial binding to PduA and subsequent encapsulation by 
the BMC (Fan et aU 2012). Analysis of the PduD N- 
terminus suggests E5, L8 and 112 would be the analogous 
important targeting residues in this protein. 

The synthetic BMC designed here contains hexa-His 
affinity tags on four of the seven shell subunits. This 
allows isolation of all of the BMC components, including 
those that were not tagged, and some non-tagged cargo 
proteins. As the PduA, PduJ and PduT proteins were 
not hexa-His-tagged but also co-purified with the other 
Pdu proteins following IMAC, it could be concluded 
that at least some of the native protein-protein interac- 
tions are maintained in E. coli strains expressing the 
synthetic pUNI-ABTUNJK construct. One problem with 
the synthetic system is that it is likely that the plasmid 
designed here is not producing the shell proteins at the 
correct physiological stoichiometry. This is evident in the 
PduD2o-niCherry^'^/BMC purification experiment where 
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PduD2o-niCherry™ was immune -detected in all of the 
fractions eluting from the column (Fig. 6b), but the 
concentration of antigen was greatest in the fractions that 
eluted at the highest imidazole concentration. The later 
fractions contain less PduB^^^, which is produced at the 
highest levels in this system, but the gel banding profiles 
later in the gradient are more similar to those observed for 
authentic Pdu BMCs, which appear to contain PduB at 
similar stoichiometry to the other shell proteins (Fan et at., 
2010; Parsons et al, 2008). It is possible that PduB is being 
produced in excess here and there is a greater proportion of 
intact, mCherry-loaded, synthetic BMCs in the later- 
eluting fractions (Fig. 6). The potential for a range of 
differently sized protein complexes to be formed, many of 
which may not be from intact BMCs, is evident in the 
negatively stained electron microscopy of the isolated BMC 
components (Fig. 5). Here, a range of differently sized 
particles and large protein complexes may be present (Fig. 
5). The protein purification experiments outlined here do 
shed some light on the behaviour of PduB^^^ as this can be 
seen to form heat-stable multimers (Fig. 4). Indeed, recent 
crystallographic analysis suggests this protein does form 
trimers (Pang et ah, 2012), which corroborates this work 
and at least suggests that individual component proteins 
are behaving as expected in this heterologous system. 

The strongest evidence presented here that a cargo protein 
may actually be targeted into a BMC expressed from the 
synthetic operon comes from the protease accessibility 
assay, where PduD-GFP^^^"^ was rescued from destruction 
by the ClpAP bacterial proteasome by co-expression of the 
synthetic BMC operon (Fig. 7). However, a similar result 
may be expected if the PduD-GFP^^^"^ protein were to form 
insoluble aggregates and, indeed, the foci observed in the 
fluorescence microscopy experiment (Fig. 3) could also be 
interpreted as aggregation. The IMAC co-purification 
experiments (Figs 4 and 6) allow an argument to be made 
against inclusion body or aggregate formation, however, as 
they would be recalcitrant to purification using this non- 
denaturing technique. 

Potential applications 

The mathematical calculations outlined in this work 
suggest that the internal volume of a BMC could be 1000 
times smaller than that of the bacterial cytoplasm. Thus, 
the local concentrations of any chemical reactants that 
could be completely housed within a BMC would be up to 
1000 higher than in the cytoplasm of the cell. For example, 
the law of mass action determines that the reaction rate of a 
simple, second-order reaction is proportional to the 
product of the concentrations of the two reactants. Thus, 
if the concentration of both reactants is increased by a 
factor of 1000, then potentially this yields an increase in 
reaction rate by a factor of 10^. The potential increase in 
the reaction rate in the BMC is therefore very significant, 
especially if multiple enzymes in a single pathway can be 
CO -concentrated within a single BMC. 



Harnessing an empty BMC and concomitant targeting 
sequence has great potential for biotechnological applica- 
tions (Frank et ah, 2013). Using a living chassis for in vivo 
applications may help increase the efficiency of certain 
chemical reactions (e.g. boosting hmonene production by 
encapsulating recombinant limonene synthase with a 
source of geranyl pyrophosphate may be helpful in 
bioenergy research). Indeed, in the course of this work a 
plasmid encoding a PduD4o-hmonene synthase fusion was 
constructed (Table SI, available in Microbiology Online), 
but not tested for activity here. The synthetic BMC could 
also be used in vivo to concentrate and sequester toxic 
compounds such as arsenic. In this regard, a plasmid 
encoding a fusion between PduD4o and metallothionein 
from Fucus vesiculosus, which is an excellent arsenic- 
binding protein (Ngu et al, 2009), has already been 
constructed (Table SI). 

In conclusion, this paper describes the characterization of a 
synthetic operon encoding BMC shell proteins that can be 
expressed in an E. coli host. With further development, a 
synthetic BMC has the potential to be packed with a variety 
of non-native cargo proteins, which may be useful for both 
in vivo and in vitro applications. 
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